Genome-wide generalized additive models

نویسندگان

  • Georg Stricker
  • Alexander Engelhardt
  • Daniel Schulz
  • Matthias Schmid
  • Achim Tresch
  • Julien Gagneur
چکیده

13 Chromatin immunoprecipitation followed by deep sequencing (ChIP-Seq) is a widely used 14 approach to study protein-DNA interactions. To analyze ChIP-Seq data, practitioners are 15 required to combine tools based on different statistical assumptions and dedicated to spe16 cific applications such as calling protein occupancy peaks or testing for differential occu17 pancies. Here, we present GenoGAM (Genome-wide Generalized Additive Model), which 18 brings the well-established and flexible generalized additive models framework to genomic 19 applications using a data parallelism strategy. We model ChIP-Seq read count frequencies 20 as products of smooth functions along chromosomes. Smoothing parameters are estimated 21 from the data eliminating ad-hoc binning and windowing needed by current approaches. 22 We derived a peak caller based on GenoGAM with performance matching state-of-the-art 23 methods. Moreover, GenoGAM provides significance testing for differential occupancy with 24 1 . CC-BY 4.0 International license peer-reviewed) is the author/funder. It is made available under a The copyright holder for this preprint (which was not . http://dx.doi.org/10.1101/047464 doi: bioRxiv preprint first posted online Apr. 6, 2016;

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

GenoGAM: genome-wide generalized additive models for ChIP-Seq analysis

Motivation Chromatin immunoprecipitation followed by deep sequencing (ChIP-Seq) is a widely used approach to study protein-DNA interactions. Often, the quantities of interest are the differential occupancies relative to controls, between genetic backgrounds, treatments, or combinations thereof. Current methods for differential occupancy of ChIP-Seq data rely however on binning or sliding window...

متن کامل

A generalized sequential Bonferroni procedure using smoothed weights for genome-wide association studies incorporating information on Hardy-Weinberg disequilibrium among cases.

BACKGROUND/OBJECTIVES For genome-wide association studies (GWAS) with case-control designs, one of the most widely used association tests is the Cochran-Armitage (CA) trend test assuming an additive mode of inheritance. The CA trend test often has higher power than other association tests under additive and multiplicative disease models. However, it can have very low power under a recessive dis...

متن کامل

Estimation and Inference in Generalized Additive Coefficient Models for Nonlinear Interactions with High-Dimensional Covariates.

In the low-dimensional case, the generalized additive coefficient model (GACM) proposed by Xue and Yang [Statist. Sinica16 (2006) 1423-1446] has been demonstrated to be a powerful tool for studying nonlinear interaction effects of variables. In this paper, we propose estimation and inference procedures for the GACM when the dimension of the variables is high. Specifically, we propose a groupwis...

متن کامل

Using generalized additive (mixed) models to analyze single case designs.

This article shows how to apply generalized additive models and generalized additive mixed models to single-case design data. These models excel at detecting the functional form between two variables (often called trend), that is, whether trend exists, and if it does, what its shape is (e.g., linear and nonlinear). In many respects, however, these models are also an ideal vehicle for analyzing ...

متن کامل

به‏کارگیری مدل جمعی‏تعمیم‏یافته در تعیین نوع ارتباط عوامل خطر رتینوپاتی در بیماران دیابتی شهر تهران

  Background : One of the most important complications of diabetes, is diabetic retinopathy that causes the blindness of 10,000 people every year. Different researches have been done on retinopathy risk factors in diabetic patients. This study was carried out to check the type of relationship between retinopathy risk factors and the condition of temptation it with generalized additive models. T...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016